NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Equivariant frames and the impossibility of continuous canonicalization

Dym, Nadav; Lawrence, Hannah; Siegel, Jonathan W (July 2024, ICML'24: Proceedings of the 41st International Conference on Machine Learning)

Canonicalization provides an architecture-agnostic method for enforcing equivariance, with generalizations such as frame-averaging recently gaining prominence as a lightweight and flexible alternative to equivariant architectures. Recent works have found an empirical benefit to using probabilistic frames instead, which learn weighted distributions over group elements. In this work, we provide strong theoretical justification for this phenomenon: for commonly-used groups, there is no efficiently computable choice of frame that preserves continuity of the function being averaged. In other words, unweighted frame-averaging can turn a smooth, non-symmetric function into a discontinuous, symmetric function. To address this fundamental robustness problem, we formally define and construct weighted frames, which provably preserve continuity, and demonstrate their utility by constructing efficient and continuous weighted frames for the actions of SO(d), O(d), and Sn on point clouds.
more » « less
Full Text Available
On the hardness of learning under symmetries

Kiani, Bobak; Le, Thien; Lawrence, Hannah; Jegelka, Stefanie; Weber, Melanie (May 2024, International Conference on Learning Representations (ICLR), 2024)

We study the problem of learning equivariant neural networks via gradient descent. The incorporation of known symmetries ("equivariance") into neural nets has empirically improved the performance of learning pipelines, in domains ranging from biology to computer vision. However, a rich yet separate line of learning theoretic research has demonstrated that actually learning shallow, fully-connected (i.e. non-symmetric) networks has exponential complexity in the correlational statistical query (CSQ) model, a framework encompassing gradient descent. In this work, we ask: are known problem symmetries sufficient to alleviate the fundamental hardness of learning neural nets with gradient descent? We answer this question in the negative. In particular, we give lower bounds for shallow graph neural networks, convolutional networks, invariant polynomials, and frame-averaged networks for permutation subgroups, which all scale either superpolynomially or exponentially in the relevant input dimension. Therefore, in spite of the significant inductive bias imparted via symmetry, actually learning the complete classes of functions represented by equivariant neural networks via gradient descent remains hard.
more » « less
Full Text Available
Toeplitz Low-Rank Approximation with Sublinear Query Complexity

Kapralov, Michael; Lawrence, Hannah; Makarov, Mikhail; Musco, Cameron; Sheth, Kshiteej (January 2023, ACM-SIAM Symposium on Discrete Algorithms (SODA))

Full Text Available
Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems

https://doi.org/10.1561/2200000115

Zhang, Xuan; Wang, Limei; Helwig, Jacob; Luo, Youzhi; Fu, Cong; Xie, Yaochen; Liu, Meng; Lin, Yuchao; Xu, Zhao; Yan, Keqiang; et al (January 2025, Foundations and Trends® in Machine Learning)

Full Text Available
GULP: a prediction-based metric between representation

Boix-Adsera, Enric; Lawrence, Hannah; Stepaniants, George; Rigollet, Philippe (January 2022, Advances in neural information processing systems)

Full Text Available

Search for: All records